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ABSTRACT 

Many analyses of microwave background experiments neglect the correlation of noise in 
different frequency or polarization channels. We show that these correlations, should they 
be present, can lead to severe misinterpretation of an experiment. In particular, correlated 
noise arising from either electronics or atmosphere may mimic a cosmic signal. We quantify 
how the likelihood function for a given experiment varies with noise correlation, using both 
simple analytic models and actual data. For a typical microwave background anisotropy 
experiment, noise correlations at the level of 1% of the overall noise can seriously reduce 
the significance of a given detection. 
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1. Introduction 

The last few years have witnessed a surge of experiments measuring anisotropies in the 
cosmic microwave background. The existence of such anisotropies is now firmly established; 
measurements are becoming plentiful enough to compare different cosmological theories 
quantitatively. A useful interpretation of a given experiment relies on proper treatment of 
atmospheric and instrumental noise. This Letter focuses on one possible pitfall in analyzing 
experimental results: noise in different channels of an experiment may be correlated. This 
correlated noise may mimic a cosmological signal on the sky and significantly alter the 
interpretation of an anisotropy measurement. 

All present anisotropy experiments share several common features. Typically, an 
experiment measures the deviation of the microwave background temperature from its 
mean value; this deviation is the temperature anisotropy. Measurements are usually taken 
at several different frequencies and sometimes at different polarizations for a total of A c 
measurements at a given point on the sky. This set of measurements is then repeated in 
N p patches on the sky. 

The analysis of this type of experiment requires the correlation function, which in- 
cludes the expected contribution to the signal from cosmological sources, instrumental 
and atmospheric noise, and foreground sources. In the present work, we ignore the last 
contribution; foreground sources are considered in detail elsewhere (Brandt et al. 1993; 
Dodelson and Stebbins, 1993; Dodelson and Kosowsky, 1994). The correlation function 
and the data determine the likelihood function (see, e.g. Readhead et al, 1989; Bond et 
al. 1991), the probability of obtaining the data given a particular theory and the noise 
parameters. Explicitly, the likelihood function is given by 
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where A = N P N C is the total number of data points, C is the A x A correlation matrix, and 
D is a A-component vector containing the data. We are interested in small off-diagonal 
elements in the noise contribution to the correlation matrix. 

The actual value of the likelihood function is not significant, but rather its relative 
value for different potential theories. For the sake of simplicity, we parametrize theories 
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simply by the variance they predict in a given experiment, <r t h; then £ is a function of cr t h- 
A maximum in the likelihood function at a non-zero value of a t h marks a detection; the 
significance of a detection is reflected by the ratio of the value of the likelihood function 
at its maximum to its value at no theoretical signal. We therefore consider the likelihood 
ratio: 



In this Letter we show analytically how the likelihood ratio changes if small off-diagonal 
terms are included in the correlation matrix. We conclude that even small off-diagonal 
correlations can lead to huge changes in the likelihood ratio, and therefore in the significance 
of a detection. This analytical work, in Sections II and III, is useful but perhaps not 
completely convincing, as it involves certain simplifying assumptions. In Section IV we 
present the likelihood ratio for the Saskatoon experiment (Wollack et al. 1993), the only 
measurement of which we are currently aware that reports off-diagonal correlations. The 
difference between including such correlations in the analysis (as the group properly did) 
and neglecting them is shown to be dramatic. The Saskatoon experiment, a ground-based 
apparatus which uses a single HEMT amplifier for each three frequency channels, has 
large noise correlations, but even for experiments with substantially smaller correlations 
the difference can still be very important. 

2. Two Channel Experiment 

In this section we illustrate the importance of off-diagonal correlations with a simple 
example. Consider an experiment which measures the temperature anisotropies in two 
frequency channels at one point on the sky. In the absence of correlated noise, we need 
three pieces of information to analyze such an experiment: (i) the data, D, which in this 
case consists of two numbers, the observed temperature anisotropy in each channel; (ii) 
the theoretical prediction for the expected rms anisotropy, a t h', and (iii) the expected rms 
of the noise, a n . With two frequency channels, the latter two quantities become 2x2 
matrices. The correlation matrix is the sum of these two matrices: 
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The theoretical rms of h appears in every element, because the expected rms due to the 
cosmic signal is the same in every channel: if channel 1 measures a given signal di, channel 
2 is predicted to measure the same value for c?2 in the absence of noise. The theoretically 
expected signal in each channel is therefore correlated. Any experiment will have diagonal 
contributions to the noise; for simplicity we assume the same noise rms in each frequency 
channel. Additional off-diagonal noise components arise whenever the noise sources in 
different frequency channels are correlated. We parametrize the off-diagonal components 
by e and write the total correlation matrix as 



C = ai 
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where x = cr 2 h /a 2 . Correlated noise will most likely arise from the atmosphere or from an 
experiment's electronics. 

We can evaluate directly the likelihood function in Eq. (1) by noting that 
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Then writing the two measurements as D = a n (d + di,d+ efe), where da n is the mean 
measurement of the two channels, the likelihood function is 
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A straightforward calculation shows that C(x) peaks at x = X satisfying 
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if X is greater than zero. This immediately gives the likelihood ratio, defined in Eq. (2) , 

as 

It is instructive to consider a particular limit of Eq. (8). When the noise in each channel 
is completely correlated (e = 1), we expect to get less information from this experiment. 
Instead of two independent channels, we really have only one independent channel. Thus 
the likelihood ratio in this limit should be the same as for a one channel experiment. A 



short calculation shows that for a one channel experiment, the likelihood ratio is equal to 
(l/d) exp[(2cP — l)/2], which is indeed the value of 1Z in Eq. (8) when e = 1. 

We now show that as long as the noise in the two channels is positively correlated, 
the likelihood ratio decreases. Let us define IZo to be the likelihood ratio when noise is not 
correlated (e = 0). Then, Eq. (8) tells us that 
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This ratio is always less than one as long as < e < 2d 2 — 1. The first inequality (e > 0) 
holds when the noise is positively correlated; the second holds for any solution of Eq. (7). 
Thus positively correlated noise reduces the likelihood ratio. This problem is particularly 
acute in cosmic microwave background experiments, since the signal is also completely 
correlated in the different channels; thus if the noise is correlated, we expect the statistical 
significance of detections or upper limits to be weakened. 

Even for this simplistic one-patch, two-channel model, the effect of noise correlations 
can be non-negligible in estimating the significance of a detection. For example, if the 
signal is twice the noise level (d 2 = 4), presently a representative signal-to-noise ratio, and 
if e ~ 0.5 as it is in the Saskatoon experiment, then including the correlation decreases the 
significance of a detection by almost a factor of three. We will now show that the situation 
gets worse with multiple channels and patches. 

3. Generalization 

It is straightforward to generalize the above discussion to allow for many frequency 
and/or polarization channels and many spatial patches. For N c channels, C becomes an 
N c x N c matrix with components 
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Eq. (10) idealizes an actual experiment in two ways: (i) the noise in all the channels is 
assumed equal, so the diagonal components of Cq are equal; (ii) each pair of channels has 
equal correlation, so the off-diagonal components of C\ are all equal. The inverse of C is 



easily obtained by assuming oc 5^ + b and imposing the condition CC 1 = 1 to find 
the constant b and the normalization; the result is 
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The determinant of C is given by 
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which can be proven by induction. Combining these two expressions gives the iV c -cliannel 
likelihood function: 
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which reduces to the previous result for N c = 2. The likelihood function peaks at x = X 
satisfying 

= 1 + N C X + (N c - l)e, (14) 



and the likelihood ratio result generalizes to 
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A further generalization to N p different patches on the sky can be approximated by a 
block diagonal correlation matrix, where each block is an identical iV c -cliannel correlation 
matrix. This approximation simply raises the right side of Eq. (15) to the power N p : 
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where d 2 now represents the mean squared signal-to-noise ratio for all patches. For the 
noise piece of the correlation matrix, this block diagonal ansatz is normally a good approx- 
imation, as noise in different patches is unlikely to be correlated. However taking the full 
correlation matrix to be block diagonal is only an approximation, since the signal is likely 
to be correlated from patch to patch unless the patches are far removed from each other 



compared to the patch size. The error in this approximation does not qualitatively affect 
our arguments. 

For a typical medium angle experiment today, N p ~ 20 and N c > 3. With a signal 
to noise ratio of order two, the argument of the exponential is typically greater than 
200e/(l + 2e). If e is of order 0.5, then the significance of a detection decreases by a factor 
of order 2 10 e -50 ~ 10~ 19 when one accounts for the noise correlations. For e ~ 0.1, the 
significance decreases by ~ 10~ 7 , and noise correlated even at the 1% level may reduce the 
likelihood ratio by nearly a factor of 10. In general, we expect correlations to be important 
roughly when 



As the signal-to-noise ratio increases, the effect of correlations on the likelihood ratio 
becomes stronger, although the likelihood itself becomes more sharply peaked. 

4. Saskatoon Experiment 

In deriving the analytic result in Eq. (16) , we made three assumptions: (1) The 
noise in each channel and patch was assumed to have the same variance; (2) The noise was 
assumed to be correlated in the same way between any pair of channels; (3) The signal 
was assumed to be uncorrelated from one spatial patch to another. In this section we 
analyze the Saskatoon experiment without making any of these assumptions. The reported 
error bars (the diagonal variance of the noise) and the reported off-diagonal correlations 
replace the first two assumptions. We perform the analysis in the context of a standard 
cold dark matter (CDM) model with a Harrison- Zel'dovich-Peebles initial spectrum, which 
completely determines the correlations between patches. 

The Saskatoon experiment takes measurements at six different channels for each patch: 
three frequency channels and two polarization channels. We consider the so-called "East" 
data set, which consists of measurements in 21 separate patches (the "West" data set gives 
very similar answers). The theory gives the predicted variance not only in a given patch, 
but also the correlations between different patches. Specifically, 
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where a, b label different patches; C\ is the prediction of the theory for the / th mulitple 
moment; and Wi >a b is the window function of the experiment which depends on the chop- 
ping strategy, beam width, and spatial separation of patches a and b. For comparison, 
we previously assumed (d a db) = d a ba^ h . CDM has only one free parameter, so C1/C2 is 
fixed for all / > 2. Figure 1 shows the likelihood ratio 1Z for the Saskatoon East data as a 
function of Q TUl s = a/SC^Mtt- 

The two curves in Figure 1 correspond to the likelihood ratio with and without off- 
diagonal noise correlations. The difference is stunning. A detection which would have been 
extremely clean [72. (C2) ~ 10 11 ] becomes much less certain [72. (6*2) ~ 30] once correlations 
are accounted for. (Note that Eq. (16) actually underestimates the effects of noise corre- 
lations in this case: a^ h /a^ = .68 and e ~ 1/3, so that we expect 72/72o ~ 5 x 10~ 8 . One 
reason for this is that the off-diagonal noise elements were chosen to be equal in the simple 
model leading up to Eq. (16) , whereas in any real experiment, certain channels will be 
more strongly correlated than others.) We emphasize that the Saskatoon experiment did 
include this effect in their analysis; we use this experiment as an illustration because it 
is the only one of which we are aware that has reported the presence (or lack of) noise 
correlations. 

In summary, even relatively small amounts of correlated noise between frequency chan- 
nels can greatly affect the interpretation of microwave background experiments. The effect 
is likely to be of particular importance for those experiments which utilize a common part 
of the electronic signal path for multiple frequency channels (as in the Saskatoon experi- 
ment considered here), or for experiments with substantial atmospheric or environmental 
sensitivity. We hope the arguments presented here will prompt other groups to examine 
and report noise correlations in their experiments. 

We are especially grateful to Lyman Page and Stephan Meyer for their patient ex- 
planation of the Saskatoon experiment, and to them and Michael Turner for other useful 
discussions. This work was supported in part by the DOE (at Chicago and Fermilab) and 
by NASA through grant No. NAGW-2381 (at Fermilab). AK is supported in part by the 
NASA Graduate Student Researchers Program. 
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Figure Caption 

Figure 1. The Likelihood Ratio vs. Q rms for the East data of the Saskatoon experiment 
with and without noise correlations. Q rms is related to C2 via Q^ ms = 56*2 /47r. 
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This figure "figl-l.png" is available in "png" format from: 



http://arXiv.org/ps/astro-ph/9402053v2 



